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Abstract 

This paper is motivated by two applications, namely i) generalizations of cuckoo hashing, 
a computationally simple approach to assigning keys to objects, and ii) load balancing in 
content distribution networks, where one is interested in determining the impact of content 
replication on performance. These two problems admit a common abstraction: in both 
scenarios, performance is characterized by the maximum weight of a generalization of a 
matching in a bipartite graph, featuring node and edge capacities. 

Our main result is a law of large numbers characterizing the asymptotic maximum weight 
matching in the limit of large bipartite random graphs, when the graphs admit a local weak 
limit that is a tree. This result specializes to the two application scenarios, yielding new 
results in both contexts. In contrast with previous results, the key novelty is the ability to 
handle edge capacities with arbitrary integer values. 

An analysis of belief propagation algorithms (BP) with multivariate belief vectors un- 
derlies the proof. In particular, we show convergence of the corresponding BP by exploiting 
monotonicity of the belief vectors with respect to the so-called upshifted likelihood ratio 
stochastic order. This auxiliary result can be of independent interest, providing a new set 
of structural conditions which ensure convergence of BP. 
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1 Introduction 



Belief Propagation (BP) is a popular message-passing algorithm for determining approximate 
marginal distributions in Bayesian networks [23] and statistical physics |21] or for decoding 
LDPC codes [25J. The popularity of BP stems from its successful application to very diverse 
contexts where it has been observed to converge quickly to meaningful limits [28], [20] . In 
contrast, relatively few theoretical results are available to prove rigorously its convergence and 
uniqueness of its fixed points when the underlying graph is not a tree [1]. 

In conjunction with the local weak convergence [2], BP has also been used as an analytical 
tool to study combinatorial optimization problems on random graphs: through a study of its 
fixed points, one can determine so-called Recursive Distributional Equations (RDE) associated 
with specific combinatorial problems. In turn, these RDEs determine the asymptotic behaviour 
of solutions to the associated combinatorial problems in the limit of large instances. Represen- 
tative results in this vein concern matchings [5], spanning subgraphs with degree constraints 
|26| and orientability of random hypergraphs [18]. 

All these problems can be encoded with binary values on the edges of the underlying graph 
and these contexts involve BP with scalar messages. A key step in these results consists in 
showing monotonicity of the BP message-passing routine with respect to the input messages. 
As an auxiliary result, the analyses of [26] and [18] provide structural monotonicity properties 
under which BP is guaranteed to converge (when messages are scalar). 

The present work is in line with [26], [18] and contributes to a rigorous formalization of the 
cavity method, originating from statistical physics [22], [16], and applied here to a generalized 
matching problem [19] . The initial motivation is the analysis of generalized matching problems 
in bipartite graphs with both edge and node capacities. This generic problem has several 
applications. In particular, it accurately models the service capacity of distributed content 
delivery networks under various content encoding scenarios, by letting nodes of the bipartite 
graph represent either contents or servers. It also models problem instances of cuckoo hashing, 
where in that context nodes represent either objects or keys to be matched. 

Previous studies of these two problems [181 Ej essentially required unit edge capacities, 
which in turn ensured that the underlying BP involved only scalar messages. It is however 
necessary to go beyond such unit edge capacities to accurately model general server capacities 
and various content coding schemes in the distributed content delivery network case. The 
extension to general edge capacities is also interesting in the context of cuckoo hashing when 
keys can represent sets of addresses to be matched to objects (see Section l3T|) . 

Our main contribution is Theorem [H a law of large numbers characterizing the asymptotic 
size of maximum size generalized matchings in random bipartite graphs in terms of RDEs. It 
is stated in Section [2J It is then applied to cuckoo hashing and distributed content delivery 
networks in Section [3l providing generalizations of the results in [18J and [17J respectively. 

Besides obtaining these new laws of large numbers, our results also have algorithmic im- 
plications. Indeed to prove Theorem [TJ in Section 0] we state Proposition [2j giving simple 
continuity and monotonicity conditions on the message-passing routine of BP which guarantee 
its convergence to a unique fixed-point. This result is shown to apply in the present context for 
the so-called upshifted likelihood ratio stochastic order. Beyond its application to the present 
matching problem, this structural result might hold under other contexts, and with stochastic 
orders possibly distinct from the upshifted likelihood ratio order, to establish convergence of 
BP in the case of multivariate messages. 

2 Main result 

Let G = (V, E) be a finite graph, with additionally an integer vertex-constraint b v attached to 
each vertex v € V and an integer edge-constraint c e attached to each edge e £ E. 
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A vector x = (x e ) ee E £ ^ E is called an allocation of G if 

Ve £ E, < x e < c e and \/v £ V, x e < b v , 

e&dV 

where dv is the set of edges adjacent to v in G. We also write u ~ v when uv G -E. 

For an allocation x of G, we define the size |x| of x as |x| := ^2 ee ^x e , and we denote by 
M{G) the maximum size of an allocation of G. Our aim is to characterize the behaviour of 
M{G)/\V\ for large graphs G in the form of a law of large numbers as \V\ goes to infinity. 

We focus mainly on sequences of graphs (G n ) n ^ which converge locally weakly towards 
Galton- Watson trees G. In short (we will explain more in detail later), what this convergence 
means is that, if we let R n be a vertex chosen uniformly at random in G n , what R n sees within 
any finite graph distance k looks more and more like the /c-hop neighborhood of the root of a 
Galton- Watson tree as n — >■ oo. Such a tree is characterized by a joint law $ ~ (D, W, {Ci}f =1 ) 
for respectively the degree, vertex-constraint and adjacent edge-constraints (counted with mul- 
tiplicity) of the vertices of G. We always assume that the graphs are locally finite, i.e. D < oo 
a.s. 

To sample a Galton- Watson tree G, we first draw a sample from $ for the root. Then we 
construct at each dangling edge the missing vertex and its other adjacent edges (therefore maybe 
creating new dangling edges), until no dangling edge remains. Independently for each dangling 

edge and conditionally on its capacity cq, we draw a sample (D, W, {Ci}f =1 \co) ~ <J>(-|co) for the 
number of other adjacent edges (not counting the dangling edge), the^ capacity of the vertex, 
and the other adjacent edge-constraints. Specifically, the distribution $ is given by 

s ,t ,r s ~ - , b, {cp, ci, ■ ■ ■ , c J _ 1 })(l + Zti Igj = Q>)) 

<P(Gt — 1, 6, iC±, . . . , Cr , Cn = -j— : . 

E (dMci ,..., Cd _ l}) HdA {co, . . . , c^DU + Eti 1(Q = co)) 

The construction above can be extended to bipartite graphs G = (AUB,E). In that case, there 
are two laws <S> A and <S> B for the characteristics {D A , W A , {C A }f^) and (D B , IF B , {Cf }g) of 
vertices in A and 1? respectively. These verify the consistency relation for all edge capacities c: 



D J 



E[D A ] 



c = 



1=1 



i=l 



The construction then alternates between <& A and <J> B for vertices at even and odd distances 
from the root. 

We define [z]% = max {x, min{y, z}}. Our main result allows to compute the limit A4(& A , § B ) 
of M(G n )/\A n \ when (G n ) raS N converges locally weakly towards a bipartite Galton- Watson tree 
G = {A U B, E) defined by $ A and 

Theorem 1 (Maximum allocation for bipartite Galton- Watson limits). Provided E\W A \ and 
¥*\W B ] are finite, the limit A4($ A , & B ) := lim n ^oo M(G n )/\A n \ exists and equals 



M{<S> A ,$ B )= inf^E 



E[D A ] 



'¥,[D B ] 



E 



min|^,Ef = >,(^) 

w B -Ef=ik B -E^^(ci 



s\ + 



where for alii, (Xi(c),Yi(c)) ceN is an independent copy of (X(c),Y(c)) c€N , and the infimum is 
taken over distributions for (X(c), Y (c)) cgN satisfying the RDE 



Y(c) 



w A -Y J x t {cf) 



1=1 



- c 








D B 


c 






C A = c 


>;X(c) = < 




W B_Y jYi {C B ) 




°0 


- 








i=l 
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Remark 1. A similar result holds when the graphs are not bipartite; the limiting tree is then 
simply a Galton-Watson tree described by a joint distribution <3?. We set & A = <& B = and 
the formula in Theorem [7] then computes lim n _ i . 00 

3 Applications 

We now apply Theorem[T]to performance analysis of generalized cuckoo hashing and distributed 
content-delivery networks. 

3.1 Cuckoo hashing and hypergraph orientability 

Cuckoo hashing is a simple approach for assigning keys (hashes) to items. Given an initial 
collection of n keys, each item is proposed upon arrival two keys chosen at random and must 
select one of them. Depending on the number m of items and the random choices offered to 
each item, it may or may not be possible to find such an assignement of items to keys. In the 
basic scenario, it turns out that such an assignement will be possible with probability tending 
to 1 as to, n — > oo for all m = \rn\ with r < ^. 

The basic problem can be extended in the following meaningful ways: 

• each item can choose among h > 2 random keys [9l fl"2| [T3] ; 

• each key can hold a maximum of k items [10\ [6j [TTJ ; 

• each item must be replicated at least I times |14t [T8] ; 

• each (item, key) pair can be used a maximum of r times (not covered previously) 

the basic setup corresponding to (h, k, I, r) = (2, 1, 1, 1). We let k h r be the associated thresh- 
old, i.e. if to = [rn\ with r < T^ klr then an assignement of items to keys satisfying the 
conditions above will exist with probability tending to 1 as m,n — > oo; on the contrary, if 
T > T h k i r> t nen the probability that such an assignement exists will tend to as to, n — > oo. 

An alternative description of the present setup consists in the following hypergraph orienta- 
tion problem. For h € N*, a /i-uniform hypergraph is a hypergraph whose hyperedges all have 
size h. We assign marks in {0, . . . , r} to each of the endpoints of a hyperedge. For I < h in 
N*, a hyperedge is said to be (I, r)-oriented if the sum of the marks at its endpoints is equal to 
I. The in-degree of a vertex of the hypergraph is the sum of the marks assigned to it in all its 
adjacent hyperedges. For a positive integer k, a (k, I, r)-orientation of a /i-uniform hypergraph 
is an assignement of marks to all endpoints of all hyperedges such that every hyperedge is (I, re- 
oriented and every vertex has in-degree at most k; if such a (k, I, r)-orientation exists, we say 
that the hypergraph is (k, I, r)-orientable. We now consider the probability space H nm h OI the 
set of all /i-uniform hypergraphs with n vertices and to hyperedges, and we denote by H. n , m ,h a 
random sample from Ji n ^m^h- ln this context, we can interpret Theorem [1] as follows: 

Theorem 2 (Threshold for {k, Z, r)-orientability of /i-uniform hypergraphs). Let h,k,l,r be 
positive integers such that k,l > r, (h — l)r > I and k + {h — 2)r — I > (i.e. at least one of the 
inequalities among k > r and (h — l)r > I is strict). We define <fr A and by (h, I, {r}) ~ & A 
and (Poi(r/i), k, {r}) ~ &f , and 

T^ Altr = S np{r:M^ A ,^)<l}. 

Then, 

( I if T < T* 

lim F (H„ \-„ \ h is (k,l,r)-orientable) = < „ .„ h,k,l,r 

n^oo V n,lrn}A J [0 if T > 

This result extends those from [18j . where the value of the threshold T^ kll was computed. 
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3.2 Distributed content delivery network 

Consider a content delivery network (CDN) in which service can be given either from a powerful 
but costly data center, or from a large number of small, inexpensive servers. Content requests 
are then served if possible by the small servers and otherwise redirected to the datacenter. One 
is then interested in determining the fraction of load that can be absorbed by the small servers. 
A natural asymptotic to consider is that of large number m of small servers with fixed storage 
and service capacity and large collection n of content items. 

The precise model we consider follows the statistical assumptions from [T7j. It is described 
by a bipartite graph G = (AU B,E), where A is the set of servers and B the set of contents, 
\A\ ~ \B\t. An edge in E between a server s in A and a content c in B indicates that server s 
stores a copy of content c and is thus able to serve requests for it. 

An assignement of servers to requests corresponds exactly to an allocation of G provided 
the vertex-constraint at server s is its upload capacity, the vertex-constraint at content c is its 
number of requests cj c , and the edge-constraint is oo. Thus, M(G) is the maximum number 
of requests absorbed by the small servers. Assuming & A is the distribution of storage and 
upload capacity of the servers and $ s the distribution of number of replicas and requests of 
the contents, then A4(& A , <fr B ) computed from Theorem [1] is the asymptotic maximum load 
absorbed by the servers (in number of requests per server). This represents a generalization of 
the results in |17j which handled only servers with unit service capacity, while our result applies 
to any capacity distribution with finite mean. 

Furthermore, the addition of edge capacities also allows us to model more complex cases. 
Suppose that all contents may have unequal sizes, say the size of a randomly chosen content is 
a random variable L, and that each content is fragmented into segments of constant unit size. 
The storage and upload capacity of the servers is then measured in terms of size rather than 
number of contents, and the servers now choose which content and also which segment they 
store. 

Assume further that when a server chooses to cache a segment from content c, instead of 
storing the raw segment it instead stores a random linear combination of all the l c segments 
corresponding to content c. Then, when a user requests content c it needs only download a coded 
segment from any l c servers storing segments from c, as any l c coded segments are sufficient to 
recover the content c. An assignement of servers to requests still corresponds to an allocation of 
G, with the vertex-constraints at servers unchanged, the vertex-constraints at content c equal to 
uj c l c and the edge-constraints linked to a content c equal to u c . Indeed a given encoded segment 
can be used only once per request of the corresponding content. Then, letting <& A and <3? B be 
the appropriate joint laws, A4(& A ,$ B ) is the asymptotic maximum absorbed load (in number 
of fragments per server). 

One could then follow the same path as in [17J and determine the replication ratios of 
contents based on a priori knowledge about their number of requests so as to maximize the load 
asymptotically absorbed by the server pool; this is beyond the cope of the present paper. 

4 Main Proof Elements 

We start with a high level description of this section. The proof strategy uses a detour, by 
introducing a finite parameter A > playing the role of an inverse temperature. For a given 
finite graph G, a Gibbs distribution /j,q is defined on edge occupancy parameters x (Section 
14. 1[) such that an average under approaches the quantity of interest M(G)/\V\ as A tends to 
infinity. Instead of considering directly the limit of this parameter over a series of converging 
graphs G n , we take an indirect route, changing the order of limits over A and n. 

We thus first determine for fixed A the asymptotics in n of averages under l^Q n - This is 
where BP comes into play. We characterize the behaviour of BP associated with on finite G 
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(Section 22]) i establishing its convergence to a unique fixed point thanks to structural properties 
of monotonicity for the upshifted likelihood ratio order, and of log-concavity of messages (Sec- 
tions S3] and H31). This allows to show that limits over n of averages under are characterized 
by fixed point relations a la BP. Taking limits over A — > oo, one derives from these fixed points 
the RDEs appearing in the statement of Theorem 1. It then remains to justify interchange of 
limits in A and n. These last three steps are handled similarly to [18] (see appendix). 

Before we proceed we introduce some necessary notation. Letters or symbols in bold such as 
x denote collections of objects (£«)«<=/ for some set /. For a subset S of /, xg is the sub-collection 
(xi)izs an d |xs| := Ylii£S x i ls the Li-norm of xg. Inequalities between collections of items 
should be understood componentwise, thus x < c means x% < Cj for all i G I. For distributions 
nii, we let mg(x) := Y\ ieS rrii(xi). When summing such terms as in ^xgN s -|x|<f> x <c m s( x )> we 
shall omit the constraint x G N s . Similarly, we let *sm = *i e s m i- 



4.1 Gibbs measure 

Let G = (V,E) be a finite graph, with collections of vertex- and edge-constraints b = (b v ) ve y 
and c = (c e ) eg £;). The Gibbs measure at temperature parameter A G M+ on the set of all 
vectors in N E is then defined, for x G N , as 

Mg(x) = ^7TT A ' X|l ( x allocation of G) = ~4tt^ |x| II Xe ^ b ^ U ^ ^ Ce )' 

G ^ G ^ ' veV eddv e€E 

where Zq{\) is a normalization factor. 

When A — > oo, /i^, tends to the uniform probability measure on the set of all allocations of 
G of maximum size. Thus, lim,\-»oo A t G r (l-^l) = M(G), where /Xq(|X|) is the expected size of a 
random allocation X drawn according to /i^. Hence, we can compute M(G)/\V\ as follows: 



1 1 \v€V 1 1 / 



\eedR 



(1) 



where R is a root-vertex chosen uniformly at random among all vertices in V, and the first 
expectation is with respect to the choice of R. 



4.2 Associated BP message passing 

We introduce the set E of directed edges of G comprising two directed edges ut> and for each 
undirected edge uv G E. We also define dv as the set of edges directed towards vertex v G V, 
^h) as the set of edges directed outwards from v, and d~£ := {wi) w <zQ v \ u if ~£ is the directed 
edge v&. 

An allocation puts an integer weight on each edge of the graph. Accordingly the messages to 
be sent along each edge are distributions over the integers. We let V be the set of all probability 
distributions on integers with bounded support, i.e. 

V = \ p G [0, 1] N ; ^2p(i) = 1 and 3k G N such that p(i) = 0, Vi > k \ , 
I ieN J 

and V the set of distributions in V whose support is an interval containing 0. 

A message on directed edge e with capacity c e is a distribution in V with support in 
{0, ...,c e }. The message to send on edge outgoing from vertex v is computed from the 
messages incoming to v on the other edges via 

v {X) lm]M ._ ^l(x<c TO )E| y |<6,-,m^(y) 

2^t<c vu A 2^\y\<b v -t U1 dUti [ -' y ) 
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where we introduced the operator 72.^ : V d ~^ — > V . For notational convenience, we write 

TZ^[m] instead of TZ^[mg-^]. We also introduce IZ-g for TzQ? . The two operators are linked 
via the relationship 



^ [ K j E ( > A^H(t)' 

We also define an operator T> v : "P — > R + meant to approximate the average occupancy at 
a vertex v under /j,q from the messages incoming to v : 

V, , <h Ix|m7r>(x) 



£|x|<6„ m ^( x ) 

Finally we denote by TZq^ the operator that performs the action of all the 1Z^} for all ~& 

,(A)r i_/t,(A) 



simultaneously, i.e. 1Z G ' [m] = \ [m]J_^ _^ (the same type of notation will be used for other 
operators). It is well known that belief propagation converges and is exact on finite trees |21j : 

Proposition 1. In a finite tree G, the fixed point equation m = 7?^[m] admits a unique 
solution m( A ) € V , and it satisfies for every vertex v: 

\eedv / 

However, to be able to take the limit as the temperature parameter A goes to infinity as well 
as to deal with cases when G is not a tree anymore, we need to study further the operators IZ-g 
and V v , which we term the local operators. 

4.3 Structural properties of local operators 

In this section, we focus on the one-hop neighborhood of a vertex v of a graph G, i.e. on vertex 
v and its set dv of adjacent edges. We thus only consider the directed edges in dv U ch). We let 
b v be the vertex-constraint at v and c = (c e ) eg g 1 , be the collection of the edge-constraints on 
the edges in dv. 

Among the many stochastic orders studied for comparing distributions (see e.g. |23j). the 
one adapted to the structure of operators IZ-g and V v is the so-called upshifted likelihood-ratio 
stochastic order (abbreviated ht)- For two distributions m and m! in V, we say that m is 
smaller than m' (for the lr f stochastic order) and we write 



m <i r ^ m! if m(i + k + l)m'(i) < m(i + l)m'(i + k),\/i, k,l £ N. 

m(i) 



In particular, if m and m' have the same interval as support, we have m < lrf w! O ^± < 



m fetj^ , for all i for which the denominators are non-zero. 

We shall also need the following definition. A distribution (pj)j>o is log-concave if its 
support is an interval and PiPi+2 < Pi+\-> f° r au * £ N. This property has strong ties with the 
lr f-order. In particular one can note that p is log-concave if and only if p <\ r ^ p. We let V\ c C V 
be the set of all log-concave distributions over integers with finite support, and V\ c = V n V\ c : 

'Pic = I P £ [0, 1] N ; ^^p(z) = lj P is log-concave, and 3A; G N such that p(z) = 0, Vi > A; > . 

I iGN J 

The key result of this Section is then the following: 
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m 2 . 



Proposition 2 (Monotonicity of the local operators for the lrf-order). The operator 1Z\ 
is non-increasing; furthermore, if the inputs of TZ^U are log-concave, then the output is also 
log-concave. The operator T>^ is non- decreasing, and strictly increasing if all its inputs are 
log- concave with in their support. 

The proof will rely on the following lemma from [27] establishing stablity of lr f-order w.r.t. 
convolution: 

Lemma 1. For a set ~3 of directed edges, if <^ m 2 ^- in , then *-^m 1 *-g 
We shall also need the following notions: 

• the reweighting of a vector m by a vector p is defined by m . p(x) := ^ m^pfo) ^ or 
x G N, for p and m with non-disjoint supports and \p\ < oo or \m\ < oo. If p or m is in 
V, then m.peV. Note that 7^ } [m] = A N . TZ-^[m], where A N = {X x ) xe N- 

• the shifted reversal of a vector p is defined by p R {x) = p(b v — x)l(x < b v ) for x G N; if 
p G V and its support is included in [0, b v ], then p R G P as well. 

It is straightforward to check that 

Lemma 2. Reweighting preserves the h"\ -order; shifted reversal reverses the lr~\ -order. 

Note that by the previous lemma it suffices to prove the results of Proposition [2] for TZ-g and 
they will then extend to 7zQ . For space reasons, we prove here only the part of the statement 
concerning TZ-^, and only for inputs in V . The rest of the proof is deferred to the appendix. 

Proof. Let ~£ be an edge outgoing from vertex v, and m^,m^ G -p9~i sucn that m^ <i r ^ 
m dlt- Let S [o,b v )( x ) = 1(0 <x < b v ); we have 5 [oM * dl ? m l (x) = Y, x -b v <\ y \<x m k>(y)' <Wl 
is log-concave, so 6[ 0>bv ] <i^ 6[ 0>bv ] and Lemma Q] then implies 8[ 0;bv ] * d -^ m 1 < [rt 5[ 0;bv ] * d -^ m 2 . 
Lemma [2] then says (^[o^] m1 )^ >irt {^[o,b v ] *a~t m 2 )^. It is easy to check that 

ft-^irf] = Vc e ] ■ (<Wl m*)* : ( 2 ) 

and as (^[o^] *g-£ m 1 )^ (0) > Lemma [2] again implies that ^-^[m 1 ] >\ T ^ 7Z-^[m 2 ]. 

If now uig-^ G V\ c e , then m d -g <i r ^ m a -^ and 7£-^[m] >i r ^ 7£-^[m], hence 7£-^[m] G V\ c . □ 



To pave the way for the analysis of the limit A — > oo, we distinguish between two collections 
nessages m^+ and in V and introduce additional operators. For an edge e outgoing 



from ?; we define the operator Q-^ : V — > V by [n] = TZ-^ [A . n] , where A . n = 

(A N . n-^).^. -g. As reweighting preserves the lr f-order, the operator Q^' is non-increasing. 
It also verifies the following useful monotonicity property with respect to A, proven in the 
appendix: 

Proposition 3 (Monotonicity in A). For n 9 -^ G V de , the mapping A h-> Q^[n] is non- 
decreasing. 

As A — > oo, limiting messages may not have in their support. We thus define a-^ as the 
infimum of the support of m-g G V, i.e. a-g = min{x G N : m-g(x) > 0}, and as the 
supremum of the support of n-g G V, i.e. fi-g = max{x G N : n-^(x) > 0}. When there 
may be confusion, we will write a(m-g) and j3(m-^) for the infimum and the supremum of the 
support of m-g. We also extend the definition of the local operators given previously so that 
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they allow inputs with arbitrary supports in N: for an edge e outgoing from vertex v, we define 
: V 9lt ^V,V V :V^^ R+, : V 9lt -> V and 5^ : N 9lt ^ N as 

, 1 ^ Ce )^|y|<'H>- 3; m o^(y) if \n,-+\<h 

n-fhnUx) = I £ t < Ce £| y |«,„- t «w(y) Wl^ " (3) 
' - otherwise 

V v [m] = { Ew^mgjCx) if l°&l<^ (4) 
I 5^ otherwise 

Q^[n](x) = { E t < Ce E| y | =6 „- t n^(y) 11 Wl ^ C e (5) 

I ^ otherwise 
5^(x) = [&,, - I^tH]? ■ (6) 

Note that the support of 7£-^-[m] is {0, ... , S-g-(a)} and that of Q-^[n] is {S-g(/3), . . . , c e }. The 
following result is established in the appendix: 

Proposition 4 (Continuity for log-concave inputs and limiting operators). The operators IZ-g 
and V v given by equations (Ej),|^]) are continuous for the L\ norm for inputs in Vi c . Also, Q-f 
defined in equation satisfies Q-g [n] = lim tA^oo Q§ [n] for any n d -g € V d ~^ . 

It follows naturally that Q-f is non-increasing. Moreover, we can extend the results of 
Proposition [5] to the extended operators, i.e. H-g is still non-increasing and T> v non-decreasing. 

4.4 Convergence of BP on finite graphs 

The main result of this section is the following 

Proposition 5 (Convergence of BP to a unique fixed point). Synchronous BP message updates 
according to m i+1 = 'R.q [m t ] for t > converge to the unique solution of the fixed point 

equation m = TZq [m] . 

Proof. For all ~& £ E initialize the message on ~i£ at = do £ V\ c - As 'R^ is non-increasing 
and So is a smallest element for the lr f order, it can readily be shown that the following 
inequalities hold for all t > 0: 

m Slrt m — lrt m _lrf m 

In other words the two series (m 2t ) t >o and (m 2t+1 ) t >o are adjacent and hence converge to 

respective limits m~, m + such that m <\ r ^ m + . Continuity of further guarantees that 

m + = 7?^(m _ ) and m~ = TZq\ui + ). Moreover, considering any other sequence of vectors 

of messages (m'*)t>o with an arbitrary initialization, since m° <i r -|- m'°, monotonicity of T^q 1 
ensures that for all t > 0, one has 

111 Slrf m j m Slrt m 

The result will then follow if we can show that m + = m~ . 

We establish this by exploiting the fact that T> v is strictly increasing for inputs in V\ c . As 
m <i r ^ m + and T> v is non-decreasing for the lr f-order for all v £ V, it follows T) v [mT] < 
T> v [m + ] for all v £ V . Then, summing over all vertices of G, we get 



m 



^ t^r ExeN™^)^ 111 "]^) Z^yt^v E* 6 N ^ [ m+ ] (z)™^ 0*0 



= Z^Z^ "v^ 1? r +i^ + ( \ = Z> c « m ■ 
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Hence, in fact, T> v [m ] = T> v [m + ] for all v 6 V. As T> v is strictly increasing for these inputs, 
m = m + = m( A ) follows. □ 

We finish this Section by stating results on the limiting behaviour of the fixed point of BP 
on a fixed finite graph G as A — > oo. First, adapting the argument of Chertkov [7J which dealt 
only with unitary capacities, we can show 

Proposition 6 (Correctness for finite bipartite graphs). In finite bipartite graphs, 

i lim Ia^oo E ^I m(A) ] = M (°)- 

Remark 2. In view of this proposition, BP can be used as an algorithm to compute the max- 
imum size of allocations in finite bipartite graphs, by running the algorithm at finite tempera- 
ture parameter X, computing T> v [m^] for all v from the fixed-point messages, and then letting 
A — > oo. 

In the non-bipartite case, the fixed-point at finite A admits a limit m(°°\ and the value 
of £„P„[m(°°)] is equal to Y^ v E v (a^°°^), where F v is defined in the propositions below (whose 
proof is in the appendix). This sum is computed from the infimum a^ 00 ) of the support of 
m(°°). Furthermore, a^ 00 ) can also be obtained from a fixed-point equation, of which it is the 
solution that gives the lowest value of F v . 

Proposition 7 (Limit of A — > oo). m( A ) is non- decreasing in A for the Ir "[-order, and m^ 00 ) = 
lim tA^oo m( A ) € V~f^ is the minimal solution (for the Ir^-order) o/m(°°) = Qq o TZQ[m^°°^]. 

Proposition 8 (BP estimate in finite graphs). In a finite graph G, we have 
where F v (a) = min(6„, |a^|) + (b v - |a^[) + - 

Remark 3. In a finite tree, there is only one possible value for a-g = S-g o Sq-^[q] when it 
is an edge outgoing from a leaf v: it is a-g = min{6„,c e }. It is then possible to compute the 
whole, unique fixed-point vector a = Sq ° Sq(oc) in an interative manner, starting from the 
leaves of the tree and climbing up. This gives a simple, iterative way to compute the maximum 
size of allocations in finite trees, which is the natural extension of the leaf-removal algorithm 
for matchings. 



4.5 Infinite unimodular graphs 

This section extends the results obtained so far for finite graphs to infinite graphs. As in |26tll8|. 
we use for this the framework of pQ. We still denote by G = (V,E) a possibly infinite graph 
with vertex set V and undirected edge set E (and directed edge set E). We always assume that 
the degrees are finite, i.e. the graph is locally finite. A network is a graph G together with a 
complete separable metric space 3 called the mark space, and maps from V and E to H. Images 
in H are called marks. A rooted network (G, r) is a network with a distinguished vertex r of V 
called the root. A rooted isomorphism of rooted networks is an isomorphism of the underlying 
networks that takes the root of one to the root of the other. We do not distinguish between a 
rooted network and its isomorphism class denoted by [G, r\. Indeed, it is shown in pQ how to 
define a canonical representative of a rooted isomorphism class. 

Let G* denote the set of rooted isomorphism classes of rooted connected locally finite net- 
works. Define a metric on G* by letting the distance between [Gi,ri] and [G2,r2] be 1/(1 + 5) 
where 5 is the supremum of those d > such that there is some rooted isomorphism of the balls 
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of graph-distance radius \_d\ around the roots of G{ such that each pair of corresponding marks 
has distance less than 1/d. G* is separable and complete in this metric pQ. 

Similarly to the space Q*, we define the space of isomorphism classes of locally finite 
connected networks with an ordered pair of distinguished vertices and the natural topology 
thereon. 

Definition 1. Let p be a probability measure on Q*. We call p unimodular if it obeys the 
Mass-Transport Principle (MTP): for Borel f : Q** — > [0, oo], we have 



f(G,r,v)dp([G,r]) = J ^ fiG,v,r)dp([G,r\) 



E 



Let U denote the set of unimodular Borel probability measures on Q*. For p £U, we write 
b(p) for the expectation of the capacity constraint of the root with respect to p. Our first 
result (proved in the appendix) is that the BP updates admit a unique fixed-point at finite 
temperature parameter A: 

Proposition 9. Let p with b(p) < oo. Then, the fixed point equation m = 7£^[m] admits 
a unique solution ot^ for any A € K + for p-almost every marked graph G. 

The proof differs from that in the finite graph case in that we cannot sum T> v over all the 

vertices v € V anymore. Instead, we use the MTP for f(G,r,v) = xm ( ^ _s J^^J2. LiL_ 

The rest of the reasoning goes as in the finite graph case and the proofs can be found in the 
appendix (using the MTP again, instead of summing over all directed edges): Proposition [7] is 
still valid and the following proposition is analogous to Proposition [8] 

Proposition 10 (BP estimate in unimodular random graphs). Let p £U with b(p) < oo, 
limtA-»oc / V r [ m ^]dp([G,r}) = [ 2? P [m(°°)]dp([G,r])= / F r (cx^)dp([G,r}) 



ini [ F r (a)dp([G,r}), 

joS G (a) J 



a=SGoSG(ai) 

where F v (a) = min(6„, \(Xg>\) + (b v - |a^|) + . 

4.6 Prom finite graphs to unimodular trees 

Once Pr op osit ion 1 1 1 holds . the end of the proof for sequences of (sparse) random graphs is quite 
systematic and follows the same steps as in [5] , [26] and [18] . We first need to show that we can 
invert the limits in n and A (see Proposition 6 in [18 J: 

Proposition 11 (Asymptotic correctness for large, sparse random graphs). Let G n = (V n , E n ) n 
be a sequence of finite marked graphs with random weak limit p concentrated on unimodular trees, 
with b(p) < oo. Then, 

l im 2A*» = f p r [ m M]dp([G,r]) = inf / F r (<x)dp([G,r}). 

n-s-oc \V n \ J ol=Sg°Sg(ol) J 



The second step uses the Markovian nature of the limiting Galton- Watson tree to simplify 
the infinite recursions a = Sq ° Sg{°<-) into recursive distributional equations as described 
in Theorem [TJ Finally, the fact that the sequence of graphs considered in the introduction 
converges locally weakly to unimodular Galton- Watson trees follows from standard results in 
the random graphs literature (see [15] for random hypergraphs or [8] for graphs with fixed 
degree sequence). 
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5 Appendix: 
5.1 Local operators 

Proposition (Monotonicity of the local operators for the lr "f-order; Proposition [5]). The oper- 
ator IZ-g is non-increasing; furthermore, if the inputs of 7Z-g are log-concave, then the output 
is also log-concave. The operator D v is non- decreasing, and strictly increasing if all its inputs 
are log- concave with in their support. 

Proof. Let ~~£ be an edge outgoing from vertex v, and m^,ni4 G V 9v such that m?-> <i r + m?->. 

dv dv dv dv 

Firstly, if |ct|-^-| > K, then 7£-^[m 2 ] = 5o and automatically >i r ^ 5q = 7£-^[m 2 ]. 

Then, if jo: 2 ,-^! < K, we also have Ioj^-^I < l ^"^ — ^u- ^[0,b v ]( x ) = 1(0 < x < b v ) 

and ffl-t = * dl ?m l ; we have 5 [oM * 9^{x) = T, x -b v <\ y \<x m dit^- "W] is log-concave, so 
S[o,b v ] <irt 6[a,b v ] an d Lemma □ then implies <5[o,&„] * 6»I> < lrt 5[ 0)bv ] * 0\. Lemma [2] then says 
[9^) R >i r ^ [6^) R . It is easy to check that 

n^[m l ] = 6 [0iCe ] . (6 [0jbv] * dl ? m l ) ; 

and furthermore, as (<5[o,6„] *d~£ m ') K (0) > 0, Lemma [2] again implies that >\ v ^ 

If now m 9 -^ G T>f c e , then m 9 -^ m d -^ and lZ-g[m] >i r | 7£-^-[m], which shows 7£-^-[m] G 

Pic - ; ' t ' / _ 

Similarly, if \a^.\ > b v then T> v [m 2 } = b v and automatically D^m 1 ] < b v = T> v [m 2 ]. If 

dv 

now \ol%\ < b v , we also have laLJ < b v . Lemma [1] shows 9} = *^m 1 <m- # 2 = *^m 2 . As 

c9u dv ov * ov 

\a l \ < b v , Lemma[2]says <5[o,&„] ■ 9\ <irt ^[o,6„] ■ This implies that the mean of 5[o,&J ■ #J is no 
larger than that of 5[o,fe„] ■ 9 2 , which is exactly XVfm 1 ] < V v [m 2 ]. 

Furthermore, if m^U <i r t m 2 -*. in VP V and Icxi-J = | | = 0, then a direct calculation 

will shows that 7* <i r -|- 7 2 , which implies X^m 1 ] < D„[m 2 ]. More precisely, fix e G 3?;; it 
is sufficient to work with w-^-/ = 101 a H ^ ^ as then the loose inequality obtained 
before allows to conclude. Then, there exists a minimum i G N such that m-+(i + l)m 2 ^(z) < 



m^,{i)m% 

e w e 



(i + 1). It is then immediate that 9^(i + l)6> 2 (i) < 9l(i)9 2 (i + 1) as the only term 
differing between the two sides is the one for which we have strict inequality. This implies 
(9l(z)) <b <irf {@v( x )) x < b > as we already obtained the loose inequality. For these vectors, 
Lemma [2] tells us that reweighting by 5[o,&„] preserves the strict lr ^-ordering, thus we have 
S [0,b v ] ■ 0l <irt s [0,b v ] ■ °l as claimed. □ 

Proposition (Monotonicity in A; Proposition [3]). For Tig-f G V 9 e , the mapping n 
is non- decreasing. 

Proof. Let "e^ be an edge outgoing from v, and n^-^ G "P 9e . First-of-all, the support of the 
vectors considered is independent of A G M + . We will use the expression of 7Z e from equation (J2j) . 
It is easy to check that 

e§> [n] = A N . 5 [0)Ce] . (ff pw *^ ( A N . n) ) * = 6 [0tCe] . ( (a n ) * n 

which shows that [n] is non-decreasing in A (as R is the only operator used which reverses 
the lr t-order instead of preserving it, and it is applied twice to A N ). □ 

Proposition (Continuity for log-concave inputs and limiting operators; Proposition H|). The 
operators IZ-g andT> v given by equations are continuous (for the L\-norm) for log-concave 

inputs with in their support. Also, Q-f defined in equation $E\) satisfies Qrf[v] = lim tA->oo 
Q^} [n] for any n 9 -^ G V d ~^ . 
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Proof. Consider a sequence (m^W^ in converging towards m^, which thus belongs to 

dv Ov 

V^y. Let ~£ be an edge outgoing from v: 

**[-<"]<*) - H ^ C '^'-^ M 



If \otQ-$\ < b v , then the expression above is clearly continous. If \olq-$\ > b, then the numerator 
is equf 
Then, 



is equivalent as k — > oo to J2\ y \=b v m a"^(y) anc ^ the numerator to J2\ y \=b v -x m a"^(y)' ^ or x — Ce - 



o 

A similar reasoning applies to V v : 



lim TZ^[m^>](x) = 8 Q {x). 

k— >OD 



P4<n (t) ] 



^W<fe, l X l m ^( X ) 

If |o!^| < &u then the expression above is clearly continous. If \ot-^\ > b v , then the denominator 
is equivalent to X]|x|=fe ^ v 

m^(x) and the numerator to X)| x |=6 m ^( x )- So, 
lim V v [m (k) ] = b v . 

fc— >oo 

We now turn to the operator Q-g. Let n d -g G V d ~^ , we have 

W _ Kx<c e )A-E| y |< fc ,-, e A^n^ (y ) 

Suppose that 1 > 6„ — c e , then the denominator of the expression above is equivalent as A — > 
oo to A 6 - E6„- Ce <| y |<6„ *w(y) > and the numerator to l(x < c e )A b « E| y |= b „-x e *W(y) + 
o(X bv ). Hence, 

A ^°° 6 £&„- Ce <| y |<&„ n 3lKy) 

Suppose now that |/3g-^-| < ^ — c e . The denominator is equivalent to 

A ce+|^l *w(y)>0 
|y|=l/3 ai H 

and the numerator to 

l(x < c e )A a;+l/3 ^ 1 rw(y) + o(A Ce+l/3 ^ 1 ). 

Thus, 

lim Q^[n](x) = S Ce (x). 

□ 
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5.2 Finite graphs 

Lemma 3. (m 2fc ) fceN and (m 2fe+1 ) fcgN are adjacent sequences in for the Irj-order. We 
can then define m" = lim tfc-s-oo m2fc m+ = nm 4-fc->oo m 2k+1 in . They satisfy 



~ k < frt m+ = lim^ 00 m 2fc+l ^ 



m = TZq [m ] and m = TZ G [m ]. 

Proof. First-of-all, it is clear that m fc € Pic for all k £ N, and for all e , m^ >i r -|- 5o- In 

particular, m 1 >i r ^ m°, and thus m 2 <i r ^ m 1 as TZq is non-increasing; iterating TZq we get 
also m 3 >i r ^ m 2 . Thus, 

2 S 1 

m <i r ^ in <j r ^ m <i r | m . 

Now, it is easy to see that applying TZq o TZ^ to the equation above gives m 2 <i r -|- m 4 <i r -|- 
m 5 <i r ^ m 3 ; keeping iterating TZq o^ a ' thus yields two adjacent sequences as claimed. Taking 
the limit k — > oo in m 2fc <i r -|- m 2fc+1 gives m _ <\ v ^ m + , and similarly letting k — >■ oo in 
m 2fc+i = 7jW[ m 2fc] and m 2fc+2 = K W[ m 2W] yields m + = ^ A) [m-] and m = ftg°[m+] 
respectively, due to Proposition HI □ 

Proposition (Unique fixed point in finite graphs at finite A; Proposition [5]). In a finite graph 
G, the fixed-point equation m = ^^[m] admits a unique solution E v]~ ■ 

Proof. As m~ <i r ^ m + and T> v is non-decreasing for all v E V, it follows X\,[m~] < X\,[m + ] for 
all v G V. Then, summing over all vertices of G we get 

y v War] = yy ^ ,gN xm ^ t" 1 "] ^ 

't&ho E, e N^[m+](x)m±,(x) 

V- V- Ex&N x7l ^i m+ }( x ) m ^( X ) ST<r,I +i 

Hence, in fact, D v [m~] = V v [m + ] for all v £ V. As is strictly increasing for these inputs, 
it follows m~ = m + = m^ A \ 

Finally, if m € V is another solution of m = [m]. Then m >i r ^ m°, and thus by 

iterating TZq we obtain m <\ T ^ m <\ T ^ m + . Thus, m = m^. □ 

We do not include a proof of the following proposition, as it is not part of the development 
towards the main theorems, and is only mentionned here as a related result having interesting 
algorithmic consequences. 

Proposition (Correctness for finite bipartite graphs; Proposition [6|). In finite bipartite graphs, 

ilim Ta->oo £x\,[mW] = M(G). 

Proposition (Limit of A — > oo (Proposition W)). is non- decreasing in A, and m(°°) = 

lim tA^oo m( A ) € is the minimal solution (for the Ir^-order) of = Qg ° T^c^m^ 00 )]. 
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Proof. We first show that A i— > m( A ) and A i— > A N . are respectively non-decreasing and 
non- increasing, where A _N (x) = X~ x for x G N. We proceed by induction: m% = 5q for 

all ~£ G 1^, hence A i— > m° and A i— > A N . m° are constant functions. Suppose now that 
A i — y m k and A h- > A _N . m k are respectively non-decreasing and non-increasing, for some k G N. 
A~ N . m fc+1 = IZq [m k ]. As A ^ m k is non-increasing and TZg is non-increasing, it follows 
A i— > A~ N . m fc+1 is non-increasing. Similarly, let A' > A and call m' the messages associated 
with A'. We have m fe+1 = Q^[A _N . m fc ] <i r ^ Q^[A'~ N . m' k ] because m k is non-increasing 
m A and Qg is non-decreasing. Also, is non-decreasing in A, so Qq\\' N . m' k ] <\ r ^ 

Taking the limit k — > oo, we obtain that A i— >• m( A ) and A H > A~ N . m( A ) are respectively 
non-decreasing and non-increasing. This allows us to define m(°°) = lim tA^oo m( A ) G and 
n (oo) _ lim A~ N . mW G V£ . Passing to the limit A ->• oo in A~ N . = ^ G [m( A )] and 

in mW = Qg°[A~ N . mW], we obtain 

n (°°) =na[iaW] and m(°°> = Q G [n (oo) ]. (7) 

Let m G P be another solution of the two-step equation given by ([7|). We have m-g >\ r ^ 8q, 
hence m 2k <\ r ^ (^Qq^ ° TZq^J [m] <irf m i where the first inequality is obtained by applying the 
non-decreasing operator Qq oTZg k times and the second one follows from the fact Qq oT^cfm] 



is non-decreasing in A, thus [Qq^ ° TV\ [m] <irf [Qq ° T^) K [ m ] = m - Taking the limit k 
yields <i r ^ m, and then A — > oo gives m*- 00 ** <i r -|- m. □ 

The case of infinite unimodular graphs being more general, the proof of the following result 
is included only in this context (see Proposition I10p . 

Proposition (BP estimate in finite graphs; Proposition [8]). In a finite graph G, we have 

lim Ta->oo V D„[m (A) ] = T V v [mW) = V F 9 ( a W) = inf V F v (a), 
vev vev vev «=s G oS G (a) ^ y 

w/iere F (a) = min(&„, \etg>\) + (6„ - |«|^|) + - 
5.3 Infinite unimodular graphs 

Proposition (Proposition [9|). Lei p £ U with b{p) < oo. Then, the fixed point equation 
m = TZ^ [m] admits a unique solution for any A G M + /or p-almost every marked graph 
G. 

Proof. The proof differs from that in the finite graph case in that we cannot sum T> v over all 
the vertices of v anymore. Instead, we use the MTP for f{G,r,v) = ^f gM xm ^j x V^¥t>[ m \( x ) . 

V r [m-}dp([G,r}) = /E v^ L M[G,r]) 



DC 



dp([G,r\) 



V- Y,xm Xn ¥t>[ m+ \( X ) m tt( X ) r nn n 

23 P [m+]dp([G,r]) 
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Because b{p) < oo, these expectations are finite, and as T> v is strictly increasing for all v S V 
it yields that m~> = p-almost surely. By Lemma 2.3 PQ, this result extends to the edges 

dr dr 

incoming to p-almost every vertex in G, hence m~ = m + = m( A ) p-a.s. □ 



Lemma 4. Let m^ € V 9v . We have 

av 

Proof. Both sides are equal to unless > b v . We have 

E W =6,(W-|%(/3)|)n^(x) 



2^|x|=6„ 

= (6„-|%(^)|)l(|^|>6 u ) 
= (6»- 15^09)1)+ 

where the last line follows from the fact <S^(/3) = [b v — + /3-^]o e < P~g if |/3gH > the 
inequality being strict whenever fi-g > 0, and it is then easy to check that Yl*- — 

- (1/3^1 -M<V □ 

Lemma 5. Zei F v (a) = min(6„, |<*g*|) + (b v — l Q |^,l) + > ^ v e ^- Consider m' = Qg o 7£<2[m] 
swc/i i/tat ct(m) = a^m'). 

• J/m' > H m, then f V r [m]dp{[G,r}) < J F r (a(m))dp([G,r]). 

• If to! < irt m, then fV r [m]dp([G,r}) > J F r (a(m))dp([G,r]). 

Proof. Let n = T^Gtfm]. For any v £ V, |a^(m)| < b v is equivalent to |/3^(n)| > b v . We have 

J V r [m]dp{[G,r]) = J ( min(b r , \ag(m)\) 

+ iflosWi < ME ^^g, J«MG.'D 

Furthermore, for any v ~ r, ExeN^ - a(m^))m^(x)TZ^ [m](x) is non-zero only if a(m^) < 
S^(ct(n)), which also implies |a^(m)| < b r . Thus, the second term in the expression of 
f V r [m]dp([G, r}) above is equal to 



/. YL cM (a; — a(mz±))mU(x)Ilz^,\m\(x) 
~ ExeN m #W^MW 

Suppose first that in' >i r -f m. Then, for any v ~ r, we have . 7*^[m] >i r ^ . 7^[m], 
as also afm'-i) = a(m^). Hence, we have 

ExeN( x - a ( m ^)) m ^( x )^H( a; ) E^gnO ~ a(m^))m^(a:)7e^[in](x) 



ExgN m '^( x )'R'ri ^ eN m^(x)7e^[m](x) 
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It follows 



2^ / M1? \ i / \ l(a(m^) < Srt(a(n)))dp([G,r]) 

. f \- Exen( x ~ a{m^))m'^{x)n^[m\{x) 

~ I £ E^WKSHW 1(a( " ,i^, * %(a, " ))) * [G ' rl) 

= / £ E^feW^W 1(55 * (/3(n)> < ^""^ 

= J £ £^5R5SW ifeMnlXJMWIW), 

where the last equality follows from by the Mass- Transport Principle. Given that <S^(/3(n)) < 
@{n^f.) implies |/3^(n)| > b r , Q^[n](x)n^(x) is non-zero only if <S^(/3(n)) < /3(n^), and 
|/3^;(n)| > b r implies <S^(/3(n)) < (3(n^), we have in fact 

\- E a -gN ( x - ^ (ff( n ))) M ( x ) -, / c \\ ^ o/ ^ \ 

£ ^CrtW^w 1<5f * (/3(n)) < 

in/a / \ I t h ^ ExeN^ - 5 ^(/3( n )))Q^[ n ](^)^(^) 

= KI %( „)I>M)E E^R^W 

= l(|/3 S (n)| > b, l)(b, - |%Q3(n))|)+ = (f, r - |a S (m')|)+, 
according to Lemma [H We thus obtained that 



J V r [m]dp([G,r]) < J F r (a(m))dp([G,r]) if m' 



>lrt m - 



The proof for the case m' <i r -|- m is identical. □ 

Proposition (BP estimate in unimodular random graphs; Proposition [TO]) . Let p € hi with 
b(p) < oo, we have 

hmtA^oo j V r [m^]dp([G,r}) = J V r [m^]dp([G,r}) = J F r (a^)dp{[G,r]) 

= inf f F r (cx)dp([G,r)), 

where F v (a) = mm(b v , \otg>\) + (b v - |a=^|) + - 

Proof. For any v € V, as T> v is continuous for inputs in V\ c , such as m( A ), hence we have 
lim tA_s-oo D„[m^ A ^] = D^rn^ 00 )] and the first equality follows by monotone convergence. 
Lemma [5] shows 

J V r [m^]dp([G,r]) = J F r (a(mW))dp([G,r]). 

It is clear that a(m^) satisfies a(m^°°^) = Sg o 5G(a(m' ra ')), as m(°°) satisfies equa- 
tion ([7]). Consider now a. € such that a. = Sg ° Sg(ck) and the measure p with marks a is 
unimodular. We define = 5 a ^ and m( fc+1 ) = QfjoT^Gijm^]. We have a(m^) = a for any 
k € N, and then necessarily m( fc ) >i r -j- m(°) (it suffices to look at the support of and to 
check this). It follows, because Qg °~R-g is non-decreasing, that (m( fc )) fcgN is a non-decreasing 
sequence, and thus / V r [m^]dp([G,r}) is non-decreasing in k (as T> r is non-decreasing). 
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We define m = lim tfeeN m( fe ). Clearly, a(m) > a and m = Qq o TZg(xxi). Moreover, by 
monotone convergence, we have 

limtfe^oo / Z> r [m(*>]dp([G,r]) = J V r [m]dp([G,r]) = J F r (a(m))dp([G,r]) 

> J V r [m.W]dp([G,r]) = J F r (a(mW))^([G,r]), 

where we used Lemma [5] for m and for m(°°) and Proposition [7] together with the fact T> r is 
non-decreasing to get the inequality. For any k & N we have m( fc+1 ) = QgoTZgI 111 ^] >irt m ^'> 
thus applying Lemma [5] we obtain that 

F r (cx)dp([G,r}) = j F r ( a (m^))dp([G,r]) > j V r [m^]dp([G,r]) 

/V*x>y V r [m^]dp([G,r}) > J F r ( a (m^))dp([G,r]), 

which completes the proof. □ 

5.4 From finite graphs to infinite unimodular trees 

The following proposition is not proved here. The interested reader can refer to [5], |26| and 
J, where similar results appear. 



Proposition (Asymptotic correctness for large, sparse random graphs; Proposition Let 

G n = (V n ,E n ) n be a sequence of finite marked graphs with random weak limit p concentrated 
on unimodular trees, with b(p) < oo. Then, 



,. 2M„ 
hm — — 

n— >oo V n 



V r [m^]dp([G,r]) = inf f F r (a)dp([G,r]). 

a=S G oS G (a) J 



5.5 Galton- Watson trees 

The main theorem follows quite straightforwardly from Propositions [10] and [TTJ The missing 
steps are standard and can be found in [18]; they resemble much the computation done in the 
proof of Proposition [lOj 

Theorem (Maximum allocation for bipartite Galton- Watson limits; [1]). Provided K[W A ] and 
E[W B ] are finite, the limit M(<fr A ,& B ) := \xm n -+ oa M{G n )/\A n \ exists and equals 



M($ A ,$ B )= infjE 

E[DA] 
E[D B ] 



mm 



E 



{w A ,EZ A iMc A )} 



c} 



where for alii, (Xi(c),Yi(c)) cGN is an independent copy of (X(c),Y(c)) c&N , and the infimum is 
taken over distributions for (X(c), Y (c)) cgN satisfying the RDE 



Y{c) 



D A 


c 




D B 


c 




w A -J2Mcf) 




C A = c^;X(c) = | 


w b -J2y*(c b ) 




r B 


1=1 







i=l 








Proof. Propositions 1101 and 1111 together give that 



2M(G n ) 



lim 

n^oo \A n \ + \B n \ a=S G 



inf / F r (cx)dp([G,r}) = [ V r [m^}dp([G,r}) 
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We introduce the probability measures p and p on IA by conditioning on the root being 
in A or B: p A ([G,r}) = p([G, r])l(r € A)^|±§^1, and similarly for p B . 

For A E M+, applying the MTP to p G U with f A (G, r, v) = E * ew ^4 Wgrj M^j to x ( r G ^ 
we obtain 

J %IV([5,-1) = "'"^B]'"" 1 / £/ A (C,r, r]) 

^S^M| ?/B(Gir> „ wlG , r]) 

= i^|/^ |m<J,! ' i ' ,B<[G ' rl » 

Letting A ->■ oo yields / V r [m^]dp A ([G, r\) = J V r [m^]dp B ([G, r]), which shows 

lim M^A = f V r [m^}d P A ([G,r}). 
n-^oo \A n \ J 



We then follow exactly the steps in the proof of Proposition [TU] for p A instead of p. This 
gives 

/ V r [m^]dp A ([G,r\) = inf ( / min(6 r , \ ag })dp A ([G,r]) 

J a=S G oS G (OL) I J 

+ i|^/^-i a rJ)^a G 'H)} 

As G is an unimodular tree, for any vertex v € V, all the components of a-^ can be chosen 
independently (as they are independent in a^-> \ which achieves the infimum). Then, for if 

dv 

incoming to v, ot-g is determined only from the subtree stemming from the tail of e ; furthermore 
it satisfies a-g = S-g o Sq-^[ol\. However, the distribution of the subtree at the tail of an e 
which is an input to Sg-f is the same as that of the subtree at the tail of e , by the two-step 
branching property of the bipartite Galton- Watson tree G. This implies that, for e incoming 
to a root r € A, a-g is solution of the two-step RDE given in the statement of the theorem. 
As detailed in Lemma 6 of [3], there is actually a one-to-one mapping between the solutions of 
a = Sg ° Sg[c*] on a Galton- Watson tree G and the solutions of the RDE considered here. This 
completes the proof. □ 



5.6 Cuckoo hashing 

Theorem (Threshold for (k, I, r)-orientability of /i-uniform hypergraphs; Theorem [2]). Let 
h,k,l,r be positive integers such that k,l > r, (h — l)r > I and k + {h — 2)r — I > (i.e. 
at least one of the inequalities among k > r and (h — l)r > I is strict). We define & A and $ B 
by (h, I, {r}) ~ <f> A and (Poi(rh),k, {r}) ~ § B , and 

T* hAltr = S n V {r:M^ A ^ B )<l}. 

Then, 

lim P (H n i Tn i /j is (k,l,r)-orientable) = 



f H/t< tI KI t 

I *f T > T h.k.l.r 
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Proof. For any /i-uniform hypergraph H n on n vertices, we let G n = (A n U B n ,E n ) be the 
associated bipartite graph, where B n contains the vertices of H n and A n the hyperedges of 
H n . Let \B n \ = n, and \A n \ = m = \jn\ for some r. First-of-all, it is clear by coupling that 
r H> M(® A , <l?f ) as defined in Theorem [TJ is a non-decreasing function. Let then t > ^ ^ r . 
Then, by Theorem [H we have 

lim < |, 

n— >oo \A n \ 

which immediately implies that G •n IS Bi.a. s. not (k, I, r)-orientable. 

Let now r < T^ klr . According to Theorem [1] again, we have linin._j.oo ^pjpp = I but there 
may still exist o(n) hyperedges which are not (I, r)-oriented. We will then rely on specific 
properties of H n ^ m ^ to show that a.a.s. all hyperedges are (I, r)-oriented. We follow here a 
similar path as in |1 4] 118] . It is easier to work with a different model of hypergraphs, that 
we call i? n>Pj /j, and that is essentially equivalent to the H n ^ Tn ^ h model [15]: each possible 
/i-hyperedge is included independently with probability p, with p = rh/ C?Ti) ■ 

We let f be such that r < f < t£ k lr , and consider the bipartite graph G n = (A n U B n , E n ) 
obtained from B n ^^ with p = t/i/(^~J). Consider a maximum allocation x € N En of G n . 
We say that a vertex of w £ A n (resp. a vertex w E B n ) is covered if Yle&dw = I (resp. 
YleGdw %e = k); we also say that an edge e G E n is saturated if x e = c. 

Let v be a vertex in A n that is not covered. We define K(v) as the minimum subgraph of 
G n such that: 

• v belongs to K (v); 

• all the unsaturated edges adjacent to a vertex in A n n K{v) belong to K(v) (and thus 
their endpoints in B n also belongs to K(v))\ 

• all the edges e for which x e > and that are adjacent to a vertex in B n n -fC(t> ) belong to 
ET(d) (and so do their endpoints in A n ). 

The subgraph K(v) defined in this way is in fact constitued of v and all the paths starting from 
v and alternating between unsaturated edges and edges e with x e > (we call such a path an 
alternating path). It is then easy to see that all the vertices in B n n K(v) must be covered, 
otherwise we could obtain a strictly larger allocation by applying the following change: take 
the path (ei, . . . ,e2t+i) between v and an unsaturated vertex in B n n K(v); add 1 to each x ei 
for i odd, and remove 1 from each x ei for i even; all these changes are possible due to the way 
the edges in K(y) have been chosen, and the resulting allocation has size larger by 1 than |x|. 

We will now show that the subgraph K (v) is dense, in the sense that the average induced 
degree of its vertices is strictly larger than 2. We first show that all the vertices in K (y) have 
degree at least 2. We have (h — l)r > I and v is not covered, hence v has at least two adjacent 
edges in G n which are not saturated, thus the degree of v in K(v), written deg^^y) v, is at least 
2. Let uu be a vertex in B n n K(y ). By definition, there is an edge e £ dw n K{v) through which 
w is reached from v in an alternating path, and x e < r. Then, because Yle&dw %e = k and k > r 
there must be another edge e' adjacent to w such that x e > > 0; such an edge belongs to K (v) 
and thus w is at least of degree 2 in K(y). Let now w be a vertex in A n n K (v), w ^ v. By 
definition, there must exist an edge e 6 dw n if (u) such that x e > 0. Because (h — l)r > I and 
i e > there must be another edge e' adjacent to w such that x e i < r; e' belongs to K(v) and 
thus degj^^) w > 2. 

Consider a path (e\ = (fif2), • •• ,&t = m ^(u) such that G A n n if(f) and any 

two consecutive edges in the path are distinct. We will show that at least one vertex out of 
2r consecutive vertices along this path must have degree at least 3 in K(v), by showing that 
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£ e 2(i+1)+1 < Xe 2l+1 provided v 2 (i+l) and u 2(i+i)+i have degree 2 in K{v) for all i. u 2 (i+l) 6 
-B„ n must be covered, so if &eg K ^ «2(i+i) = 2 we must have x e2{i+1) = k — x e2i+1 . Then, 
if deg^) u 2 (i+i)+i = 2 > ail tlie edges adjacent to « 2 (i+i)+i except e 2 ( i+ i) and e 2(i+1)+1 must be 
saturated, thus we must also have (h — 2)r + i e2(i+1) + x e2{l+1)+1 < This immediately yield 
! + {k + (/i — 2)r — /} < x e2i+1 , and thus 5 e2{i+1)+1 < £ e2i+ i as claimed. But x e2i+1 < r 



x 



e 2 (j + i) + i I V'" J — ^e 2i+ n ^ e2{ . +1)+1 

and so 5 e2i+2r+ i < — 1 if the hypothesis that all the vertices encountered meanwhile have degree 
2 in ) is correct, which is thus not possible. Note that we did not need to assume that the 
path considered was vertex-disjoint, hence it is not possible that K(v) is reduced to a single 
cycle. 

We will now count vertices and edges of K(y) in a way that clearly shows that the number 
of edges in K(v) is at least 7 times its number of vertices, with 7 > 1. We can always see K(v) 
as a collection P of edge-disjoint paths, with all vertices interior to a path of degree 2 in K(v) 
and the extremal vertices of a path having degree at least 3 in K(v). To form K(v) we would 
simply need to merge the extremal vertices of some of these paths. We have shown before that 
each path in P has at most 2r vertices. Let p = (ei = (t>it> 2 ), . . . , &t = (vt v t+l)) be a path in 
P, we let e E (p) = t be the number of edges in p and v (p) = Z £t€p dcg ^ v) Vi + de gx( ^ +1 be 
a partial count of the vertices in p (all the interior vertices are counted as 1 but the extremal 
vertices are only partially counted in 9y{p), as they belong to many different paths). We have 
^) = *- 1+ dii^r + dii^r <*-! + §• Hence, 



g£M> j >^>1 
6 v (p) ~ t - 1 + f - 1 - i 

Furthermore, it is easy to see that 

9e(j>) = number of edges in K (v), 

Oy(p) = number of vertices in K(y), 

which shows that the number of edges in K(v) is at least 7 = 1 1 1 > 1 times the number of 
vertices in K(v). 

Now, it is classical that any subgraph of a sparse random graph like G n with a number of 
edges equal to at least 7 > 1 times its number of vertices must contain at least a fraction e > 
of the vertices of G n , with probability tending to 1 as n — > 00 (see [EJQ3]). Therefore, K(v) 
contains at least a fraction e' > of the vertices in A n . 

There exists a natural coupling between B. n v h and Hn#,/i : we can obtain H n ^h from H n ^^ 
by removing independently each hyperedge with probability p — p > 0. This is equivalent to 
removing independently with probability p—p each vertex in A n . We let gap„ = l\A n \—M(G n ) = 
o(n). For any uncovered vertex v in A n we can construct a subgraph K(v) as above. If we 
remove a vertex w in A n n K(v) for such a v, then either this vertex w is itself uncovered, and 
then gap n is decreased by at least 1, or w is covered and then it must belong to an alternating 
path starting from v and we can construct a new allocation with size equal to that of x, and in 
which w is uncovered and there is one more unit of weight on one of the edges adjacent to v, 
hence removing w will also reduce gap n by 1. We proceed as follows: we attach independently 
to each hyperedge a of H n ^h a uniform [0, 1] random variable U a . To obtain H n ^ v h we remove 
all hyperedges a such that U a < p — p. This can be done sequentially by removing at each step 
the hyperedge corresponding to the lowest remaining U a . Then, at each step, assuming there 
are still uncovered vertices v in A n we can consider the union K of the subgraphs K(v), which 
has size at least e'rn. Hence, with positive probability the hyperedge removed will decrease the 
value of gap n . By Chernoff's bound, the number of hyperedges removed is at least rn^ 2 with 
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high probability as n — >■ oo, therefore gap n will reach with high probability as n — > oo before 
we remove all the hyperedges that should be removed. Hence, H n>Pt h (and thus H n ^ Tm ^ h ) is 
(k, I, r)-orientable a.a.s. □ 
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